Alert button
Picture for Istvan Szita

Istvan Szita

Alert button

Exploring compact reinforcement-learning representations with linear regression

Add code
Bookmark button
Alert button
May 09, 2012
Thomas J. Walsh, Istvan Szita, Carlos Diuk, Michael L. Littman

Figure 1 for Exploring compact reinforcement-learning representations with linear regression
Figure 2 for Exploring compact reinforcement-learning representations with linear regression
Figure 3 for Exploring compact reinforcement-learning representations with linear regression
Figure 4 for Exploring compact reinforcement-learning representations with linear regression
Viaarxiv icon

Optimistic Initialization and Greediness Lead to Polynomial Time Learning in Factored MDPs - Extended Version

Add code
Bookmark button
Alert button
Apr 21, 2009
Istvan Szita, Andras Lorincz

Viaarxiv icon

Factored Value Iteration Converges

Add code
Bookmark button
Alert button
Aug 13, 2008
Istvan Szita, Andras Lorincz

Figure 1 for Factored Value Iteration Converges
Viaarxiv icon

Online variants of the cross-entropy method

Add code
Bookmark button
Alert button
Jan 14, 2008
Istvan Szita, Andras Lorincz

Figure 1 for Online variants of the cross-entropy method
Figure 2 for Online variants of the cross-entropy method
Figure 3 for Online variants of the cross-entropy method
Viaarxiv icon

Reinforcement Learning with Linear Function Approximation and LQ control Converges

Add code
Bookmark button
Alert button
Mar 09, 2007
Istvan Szita, Andras Lorincz

Figure 1 for Reinforcement Learning with Linear Function Approximation and LQ control Converges
Figure 2 for Reinforcement Learning with Linear Function Approximation and LQ control Converges
Figure 3 for Reinforcement Learning with Linear Function Approximation and LQ control Converges
Viaarxiv icon

Low-complexity modular policies: learning to play Pac-Man and a new framework beyond MDPs

Add code
Bookmark button
Alert button
Oct 30, 2006
Istvan Szita, Andras Lorincz

Figure 1 for Low-complexity modular policies: learning to play Pac-Man and a new framework beyond MDPs
Viaarxiv icon

Kalman filter control in the reinforcement learning framework

Add code
Bookmark button
Alert button
Jan 09, 2003
Istvan Szita, Andras Lorincz

Viaarxiv icon

Temporal plannability by variance of the episode length

Add code
Bookmark button
Alert button
Jan 09, 2003
Balint Takacs, Istvan Szita, Andras Lorincz

Figure 1 for Temporal plannability by variance of the episode length
Figure 2 for Temporal plannability by variance of the episode length
Figure 3 for Temporal plannability by variance of the episode length
Figure 4 for Temporal plannability by variance of the episode length
Viaarxiv icon

Searching for Plannable Domains can Speed up Reinforcement Learning

Add code
Bookmark button
Alert button
Dec 10, 2002
Istvan Szita, Balint Takacs, Andras Lorincz

Figure 1 for Searching for Plannable Domains can Speed up Reinforcement Learning
Figure 2 for Searching for Plannable Domains can Speed up Reinforcement Learning
Figure 3 for Searching for Plannable Domains can Speed up Reinforcement Learning
Figure 4 for Searching for Plannable Domains can Speed up Reinforcement Learning
Viaarxiv icon